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Abstract. In our understanding, a mind-map is an adaptive en- 
gine that basically works incrementally on the fundament of ex- 
isting transactional streams. Generally, mind-maps consist of sym- 
bolic cells that are connected with each other and that become either 
stronger or weaker depending on the transactional stream. Based on 
the underlying biologic principle, these symbolic cells and their con- 
nections as well may adaptively survive or die, forming different cell 
agglomerates of arbitrary size. In this work, we intend to prove mind- 
maps' eligibility following diverse application scenarios, for exam- 
ple being an underlying management system to represent normal and 
abnormal traffic behaviour in computer networks, supporting the de- 
tection of the user behaviour within search engines, or being a hidden 
communication layer for natural language interaction. 

1 ABOUT MIND-MAPS 

Transactional streams are to be understood as an endless flow of data 
that is lost once it is read. Furthermore, the data can be classified into 
categories, for example sentences or paragraphs inside a text doc- 
ument. These categories form the transaction, having items inside, 
for example words or paraphrases. An example for a transactional 
stream may be the reading of a book, where the text is read exactly 
once but lost if it is pronounced. The management, and moreover, 
the analysis of transactional streams is often problematic for several 
reasons. One of them is that data streams are potentially infinite or 
at least their end is not known until it is actually reached. Storing 
the whole data stream is therefore not an option, and the analysis 
cannot rely on traditional mining techniques that require the whole 
dataset to be available or that need random access or multiple passes 
over the data. Currently a lot of research focuses on the processing of 
such streams of different kind. Some of the typical techniques used 
with data streams are sliding windows, incremental approaches, or 
synopses of the data. Surveys of current methods and issues can be 
found in 14|, |7|, |9|, iTSl. 

With the discussion around mind-maps, we argue for its eligi- 
bility by demonstrating its applicability on a couple of algorithmic 
ideas. In our understanding, mind-maps are to be seen as adap- 
tive and incremental knowledge structures, which live from depend- 
ing on the occurrence of an input stream. A first approach in stream 
data analysis with mind-maps had been done in the processing of 
transactional streams with the creation of mini-networks. These 
base on transactional data ( II9I ). the mini-network consists of simple 
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symbolic cells that share a weight value and that represent an individ- 
ual item in a transaction. The symbolic cells are interconnected with 
other cells that occur in the transaction as well. In a subsequent step, 
the mini-network becomes integrated to the mind-map itself, where 
those cells become merged with those in the mind-map in case that 
they are identical (merge). Using the fundamental principle of adap- 
tation and Hebbian Learning, the mind-map can be seen as a living 
engine as it is initially empty but grows over time. Since the states of 
connections and cells change over time the cells may die or revive as 
well. Focusing on the skeleton of the mind-map, a retrieve yields 
on delivering the strongest cell connections. 

Although mind-maps often refer to such a structure to process 
fluid signals like text streams, mind-maps often claim to have its as- 
sociative nature as the fundamental principle. This is true, however, 
we believe that a temporal character of such systems may be accepted 
as well to manage temporal stimulation that comes in. Secondly, 
mind-maps can be seen both from a verificative and an explorative 
perception: talking about mind-map software mostly refers to a top- 
down directed production of logically connected entities, for exam- 
ple work-flows or coherent cogitations (O, (8), 1101 '). whereas the 
word of explorative more related to a learning or discovering 
process of such logical structures. In this respect, the following ap- 
plications refers to the second class of mind-maps and present some 
algorithmic ideas for the temporal processing of text streams to map 
contextual information. A simple implementation of mind-maps is 
the system ANIMA. It refers to a mind-map model of incremental and 
adaptive nature and allows to manage associations between symbolic 
cells while having a transaction stream as input. The aim of ANIMA 
is the efficient processing and management of transactions over time, 
to present related patterns inside a stream II9I . 

2 APPLICATIONS 

2.1 TARGETING NETWORK PROTECTION 

Network-based anomaly detection II II 1211 refers to a system-based 
understanding of what the structure and the behaviour of network 
traffic is, and in this respect to identify abnormal situations. Here, 
the mind-map model ANIMA-AR is implemented to represent net- 
work traffic events while having network packets including header 
and content as one transaction, is one promising application. It frag- 
ments such packet transactions into meaningful symbolic cells within 
the mind-map and connects the packet cells. Additionally, connec- 
tion values are established relating to the corresponding frequencies, 
respectively. Due to the architecture, we may rate the usual network 
traffic by a lower connection weight - although the frequency is high 
- but may rate an abnormal/unusual network behaviour by a higher 



value - although it appears more seldom. We identify the abnormal 
traffic as it is rated significantly and therefore tolerant against tem- 
poral connection updates between the symbolic cells. The follow- 
ing sequence of pictures gives an example on how the mind-map is 
used; it refers to the management of bad signatures as described in 
fm, taking several insertion rules into account. First, the incoming 
signature ABC is considered and assigned to symbolic cells, being 
equal-weighted. 
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Then, if a new signature CDEF is added to the mini-network, and 
the weights are adapted. At each step, substrings are considered and 
evaluated as follows: given a sub-string, then if the sum of all activa- 
tion states . . . 

• ... is exactly 1, then a virus alert takes place. 

• ... is exactly 0, then no virus alert takes place. 

• ... is between and 1, an alert takes place with a probability value. 

• ... increases 1 , then it is not considered. 

For example, ABC forces a virus alert as it is infected for 100%, 
whereas only H is unlikely to be infected. To continue the example, 
as C is already known, this value stays constantly. 

©-©-@-@-®-© 

The following photographs of the mind-map refers to the situation 
when the signatures CDE 




and CDGH are being inserted. With this, the probability values for 
each signatures is clearly available through the whole life-time of the 
mind-map. More information can be found in [ iTI . 
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The mind-map model ANIMA-AR is implemented to detect well- 
known viruses. The signature of viruses is stored in a graph-like 
structure; virus signatures are managed and stored, incoming packets 
- to identify intrusions - evaluated. The scanning speed and the re- 
quired storage space outperforms current approaches and emerges 
out of the compression of the signature database. ANIMA-AR is 
theoretically analysed showing that viruses and similarities are de- 
tected. Simulations substantiate the theoretical analysis and show 
the low false-positive rate tolerating the normal system. In addition, 
ANIMA-AR is able to automatically detect similar viruses as small 
mutations or new variants. 

2.2 MANAGING IMPLICIT FEEDBACK 

The consideration of implicit feedback in the field of information re- 
trieval and the automatic collection of information about the user's 



behaviour f 151 r24'| is an application of interest. Without an explic- 
itly request of information, we intend to gain some information about 
what really interesting is. The aim is to use the mind-map as an adap- 
tive storage for such kind of information and consequently, for the 
enhancement of user-based research requests. We therefore moni- 
tor the user's behaviour in interaction with a search engine, keeping 
an eye on queries and their results, links that the user follows, and 
diverse time-related information, for example how long he or she 
stays on web sites. Search sessions like this are considered as sin- 
gle transactions for the mind-map. And, building up such a network 
provides information about typical queries, results, and measures, es- 
pecially regarding the relevance of search results, namely towards an 
enhancement of further queries. 

An example might be a giving of additional hints or altering the re- 
sults themselves. One concrete step is a re-ranking of search results, 
which may help to place those results further on the top of the list that 
are probably of greater importance according to the given query. Fol- 
lowing this, the strengthens of the mind-map are mainly due to the 
ability to cope with transactional streaming data. We concern with 
information about search sessions of users, which can easily be bro- 
ken down into transactions. Moreover, the mind-map is able to store 
only the most important aspects of the information without the need 
of storing all feedback data. This helps to keep the network at a rea- 
sonable size. Furthermore, the dynamic nature of this mind-map fits 
quite well to the purpose of this approach: if there is a change of 
the user's feedback over time, then this trend will be reflected in the 
mind-map as well. Figure [T] shows an architectural snapshot of the 
mind-map, where we use three different types of cells: 

• The query terms are single terms that are observed in user queries. 

• Queries that form the transactional input to the mind-map. 

• The resulting list of documents that have been provided by the 
underlying search engine for one or more queries. 




Figure 1. Ai'chitecture of the mind-map 1 24 



There may exist different connections between these nodes, which 
might be weighted indicating the strength of the relationship. These 
are for example connections that indicate relationships between dif- 
ferent query terms, connections between queries and query terms, 
and connections between queries and documents. More information 
can be found in |24J \25i . 

2.3 TARGETING DBLP 

Bibliographical databases such as Citeseer, Google 
Scholar, and DBLP serve as a bibliographic source with lots 
of information concerning a publication. This compounds the names 
of the authors, the publication title, the conference, and many other 
attributes. A bibliography database is accessible online, where all 



entries share an electronic index to articles, journals, magazines, etc. 
containing citations and abstracts. Understanding the bibliographic 
database as a digital collection that is intelligently managed and that 
supports a search for and the retrieve of bibliographic information 
to defined queries is a convenient procedure in demanding infor- 
mation right in time. Regularly, the retrieve bases on a collection 
of queries that consists of keywords in the publication title or the 
keyword list. For example with DBLP, the querying using a keyword 
plagiarism leads to an answer set of almost 70 articles, and a 
search refinement with detection and pattern to 34 and 2 
bibliographic entries, respectively. Accordingly, the two referenced 
publications pledge close; and the names of the authors overlap: 

1 NamOh Kang, Sang-Yong Han: Document Copy 
Detection System Based on Plagiarism Patterns. 
CICLing 2006: 571-574. 

2 NamOh Kang, Alexander F. Gelbukh, Sang-Yong 
Han: PPChecker: Plagiarism Pattern Checker in 
Document Copy Detection. TSD 2006: 661-667. 

With this, we reference to a graph structure representing the as- 
sociation of the three authors Han, Kang and Gelbukh. This is 
similar to 1*121 who introduces mini-networks. The double edges sig- 
nalise a double connection as the single (and parallel) edge between 
Han to Gelbukh and Kang to Gelbukh refers to a one-way asso- 
ciation. Whereas the meaning of a double connection is unique while 
having two publications, the double entries to Gelbukh seem to be 
ambiguous: on the one side, it refers to one common publication with 
both individually, on the other to two single publications with one of 
the other authors. However, the graph is node-oriented in a way that 
it simply represents the situation as it is: Gelbukh has one common 
publication with both of them, and here, it plays no role if this is a 
common one or not. 

For static databases, the discovery of associative patterns has been 
an area of extensive research, and multiple approaches and solutions 
to the static problem have been presented in the past. A major prob- 
lem in these approaches is the combinatorial explosion of the search 
space and the research has therefore mainly focused on reducing this 
space. Since mind-maps are targeted to data streams, it can not pos- 
sibly make use of the methods developed for static databases, since 
these algorithms require multiple passes over the data to calculate 
frequencies of associated items. This is indeed not acceptable when 
dealing with data streams, because data streams are potentially in- 
finite, or at the very least their end cannot be foreseen. In this re- 
spect, the idea of searching for temporal patterns in a bibliographic 
database like DBLP while taking the time as the core medium leads 
to a couple of interesting questions, for example 

• In general, may we discover scientific communities? While ob- 
serving the visualisation of associative relationships between au- 
thors, we might ask if such dependencies generally form a com- 
munity, and secondly, how strong these communities may be. Fur- 
thermore, if dependencies of of communities exist, are these tem- 
poral or visiting, recurring, or constant as it has been mentioned 
in section[T]? 

• Do there exist diverse trends in publishing? For example, the oc- 
currence of a common publication may be the initiator for a fruit- 
ful collaboration (which is proven by following publications on 
the same or a different research topic) 
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Figure 2. Temporary mind-map (DBLP, year of 1993). consisting of asso- 
ciated author nodes. 

2.4 SEMANTIC NET-LEARNING 

The mind-map model WYWI stands for a simple communication 
paradigm that focus on natural language communication. The model 
uses a mind-map to manage words and their relationship to others 
associatively. A sentence is read incrementally and threatened as a 
transaction with concepts and roles. Currently, only adjectives, 
nouns, and verbs are considered as worth, they are extracted and put 
into the mind-map as a semantic structure. Adjectives are considered 
as sub-concepts of nouns. For example, 

#S (CONCEPT :NAME MAN : CAT N : FATHER (ROOT) 

: CHILDREN (YOUNG) : ROLES (READ) :ACT 0.9577) 

#S(ROLE :NAME READ : CAT V :CONNECTION ((MAN 
BOOK 0.9577)) :ACT 0.9577) 

#S(ROLE :NAME SEE : CAT V :CONNECTION ((LION 
PETER 0.8253)) :ACT 0.9014) 

Here, the word READ acts as relationship (role) connects both 
MAN and BOOK, sharing a connection and activation value of 0.9577, 
respectively. Other roles exist, but are unrelated. Each time-step, the 
activation values are decreased unless the role or the associated con- 
cepts remain inactive. As for the ANIMA mind-map model, the under- 
lying net structure is refreshed after each transaction; it adapts to the 
given stimulation. However, WYWI does not use any extern source 
(for example thesaurus) but actively interacts with the user in case 
a word is unknown. This is to understand the morphologic basis or 
the lexical category. WYWI also establishes a semantic representation 
through micro-features: here, the user must rate a word semantically 
against given (predefined) entities. This produces a k-dimensional 
space, having k as the number of independent micro-features, for ex- 
ample 



MAN is ((1.0 0.0 1.0 0.0 0.0 0.0 0.0 1.0 ...)) 
LION is ((0.0 0.0 0.0 0.0 0.7 0.0 0.4 ...)) 



With this, we may perform mind-mapping over a period of time, 
for example moving a corresponding window temporarily over time. 



The communication in WYWI is performed as a conversation. 
For a sentence like The yellow lion sees Peter, the word 



YELLOW becomes a sub-concept of a word LION, whereas LION 
and PETER are connected by SEE. Additionally, SEE is becomes 
higher activated. Additionally, all concepts are represented in the k- 
space. As a consequence, transactional (dialogue) stream can be pro- 
cessed and queries be sent at each time. 

WYWI: (ENTER THE NEXT WORD OR CLOSE BY ' ) 
USER: Peter 

WYWI: COULD YOU PLEASE GIVE SOME EXPLANATION, 
LIKE THE MORPHOLOGIC BASIS? 

USER: Peter 

WYWI: AND WHAT IS THE LEXICAL BASIS OF peter? 
USER: person 

WYWI: I NEED SOME SEMANTIC INFORMATION ABOUT 
peter. HAVE A LOOK AND RATE peter FROM 
to 1 : 
TALL? 

USER: 0.9 

WYWI: And with HEAVY? 

An extended version of the semantic net-learning from text 
streams is to define a temporal mind-map for a certain actor of a 
text: where the text stream is consolidated over time by a seman- 
tic graph structure. In this work, an important issue for building an 
author-related mind-map is the resolution of linguistic anaphors. 

So far, the major idea is to process each sentence incrementally 
so that a general and pre-defined scheme structure of sub ject - 
verb - ob j ect is instantiated every time (| 14|). At the moment, 
we only concern with nouns, verbs, and adjectives, where other lin- 
guistic categories are disregarded. In this respect, verbs act as a role 
between the concepts (noun, object). Each concept may have sub- 
concepts that correspond to attributes, for example the sentence the 
old man likes the green juicy grassland is trans- 
lated into 




Figure 3. A semantic graph having man as the centric concept. Attributes 
are adjunct by dashed lines, whereas roles are connected as a circle. 




Figure 4. The anaphor he is recognized as to be related to man, whereas 
the two sentence structures are merged by the concept man. 

where man and grassland represent main concepts and 
green, juicy, and old sub-concepts. The process of relating 



identical concepts together is accomplished by a matching of iden- 
tical words and a resolution of linguistic anaphors (| 13), |17| ). The 
following Figure[4]shows the semantic structure after having read the 
sentence He walks into the forest. 

So far, the incremental processing of texts can be stopped at any 
moment. The semantic structure is a mind-map with concepts and 
relationships of consistent states. Beside focusing on the elaboration 
of methods to realize pronominal anaphora and co-reference in text 
streams, we will assign weight values to concepts and roles in order 
to prove their importance for an actor. 

The idea of content zoning is to refer to a segmentation of a text 
document into semantic zones. As indicated in |3| and moreover as 
firstly discussed in 1221 with Argumentative Zoning, the basic 
idea here is to structure texts on the basis on pre-defined categories. 
An example might be the following text, having the actors Harry, 
Hedwig, and owl: 

"Harry got up off the floor, stretched, and moved across to his 
desk. Hedwig made no movement as she began to flick through news- 
papers, throwing them into the rubbish pile one by one. The owl was 
asleep or else faking; she was angry with Harry about the limited 
amount of time she was allowed out of her cage at the moment. As 
her neared the bottom of the pile of newspapers. Harry slowed down, 
searching for one particular issue that he knew had arrived shortly 
after he had returned to Privet Drive for the summer, he remembered 
that there had been a small mention on the front about the resignation 
of Charity Burbage, the Muggle Studies teacher at Hogwarts." 

an then be zoned to 

• Harry <ACTOR=HARRY> got up off the floor, stretched, 
and moved across to his (^his => HARRY) desk. 
</ACTOR=HARRY> 

• Hedwig <ACTOR=HEDWIG> made no movement as she (^she 
= HEDWIG) began to flick through newspapers, throwing them 
(—^them = newspaperss) into the rubbish pile one by one. 

• The owl <ACTOR=OWL> was asleep or else faking: she (^she 
= OWL) was angry with Harry (-^Harry = HARRY) about the 
limited amount of time she (^she = OWL) was allowed out of 
her ( —^her OWL) cage at the moment. As her ( ^her =^ OWL) 
neared the bottom of the pile of newspapers, </ACTOR=OWL>. 

• Harry <ACTOR=HARRY> slowed down, searching for one par- 
ticular issue that he ( ^he = HARRY) knew had arrived shortly 
after he ( ^he = HARRY) had returned to Privet Drive for the 
summer, he (^he = HARRY) remembered that there had been 
a small mention on the front about the resignation of Charity 
Burbage, the Muggle Studies teacher at Hogwarts. At last he 
(^he = HARRY) found it.</ACTOR=HARRY> 

where linguistic anaphors are solved depending on the current ac- 
tor, the gender, and/or between different candidates that have already 
occurred in the text. Knowing that the owl has the name Hedwig, 
and moreover, having a hierarchy on site, the zoning can be updated 
even more. An additional analysis can be performed to gain further 
information about the separate zones. Rather simple to extract infor- 
mation could be statistics about the size and layout of zones, but a 
more sophisticated analysis of their text content is possible. The lat- 
ter can lead to an extraction of the semantic content and purpose of 
a zone. Such kind of inforination can be used for various purposes, 
such as comparing documents to each other (regarding their analysed 
zone structure and content) by simply refering to information about 
the zones as zone variables (|3|). 

We apply content zoning to text streams in order to establish a 
semantic mind-map, while using a sliding window of user-specified 
length; text is buffered in, immediately zoned and analysed. Inter- 
mediate statistical results are managed to produce a user specific 



summary on a given subject. The definition of zones is insufficient 
to an effective zoning as zones only indicate a position of an in- 
formation in the text, but do less give information about the con- 
tent. A solution might be the introduction of zone variables, which 
describe the content of the input stream and are the core parame- 
ters for the summary generation. In this respect, we concern with 
two categories of zones, namely document independent and 
document dependent zones, for example the position of the 
zone in the text stream, its length or the most occurring word (with- 
out stopwords). During the zoning process, nearly all sentences are 
attributed to zones. Useless sentences and sentences which cannot be 
attributed to a zone are skipped or can be regrouped a user-defined 
zone. After different steps as anaphor resolution for example, the val- 
ues of the variables are used for statistical evaluations to generate a 
summary on a user specific subject. During the process of buffering 
and processing the text streams, there is an option of real-time eval- 
uation, so that changing values are immediately visible to the user. 
At each instant, the user has the possibility to call a summary on a 
previously defined subject, as for example an actor of a fairy tale. 
But, also less sophisticated results as the top most occuring words or 
collocations can be called at each moment. 

3 CONCLUSIONS 

A mind-map is an adaptive engine that basically works incrementally 
on the fundament of transactional streams. Following our model, 
mind-maps consist of symbolic neural cells that are connected with 
each other and that become either stronger or weaker depending on 
the transactional stream: based on the underlying biologic principle, 
these symbolic cells and their connections as well may adaptively 
survive or die, forming different cell agglomerates of arbitrary size. 

With that, mind-maps may be applicable for the management of 
trust as well: every human shares an own attitude about others, for 
example Person R has an attitude to Person S and vice versa. 
Both are probably different from each other and different to Per son 
T's view to Person R or Person S. Furthermore, one might get 
the conclusion if Person R trusts Person S but not vice versa. 
With this approach, we may follow a proposition by |16| who sug- 
gests a model on human conversations. The attitude of someone's 
mind is modelled as a self-organising mind-map. Every person has 
a model of his/her view of the world and models for other people 
whom he/she has interacted with. All views including a self view 
and views to other persons (others) will be modified through conver- 
sations between people. We therefore intend to introduce an engine 
to find regularities between human's objects and to model trust based 
on this. The creation of an artificial mind-map, where a textual data 
stream is read and represented in an associative dynamic network. 
Each incoming stream is decomposed to its items, for example a text 
stream may be decomposed to words. 
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